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Abstract.  Executing  complex  plans  for  coordinating  the  behaviors  of 
multiple  heterogeneous  agents  often  requires  setting  several  parameters. 
For  example,  we  are  developing  a  decision  aid  for  deploying  a  set  of  au¬ 
tonomous  vehicles  to  perform  situation  assessment  in  a  disaster  relief 
operation.  Our  system,  the  Situated  Decision  Process  (SDP),  uses  pa¬ 
rameterized  plans  to  coordinate  these  vehicles.  However,  no  model  exists 
for  setting  the  values  of  these  parameters.  We  describe  a  case-based  rea¬ 
soning  solution  for  this  problem  and  report  on  its  utility  in  simulated 
scenarios,  given  a  case  library  that  represents  only  a  small  percentage 
of  the  problem  space.  We  found  that  our  agents,  when  executing  plans 
generated  using  our  case-based  algorithm  on  problems  with  high  uncer¬ 
tainty,  performed  significantly  better  than  when  executing  plans  using 
baseline  approaches. 
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1  Introduction 

Real-world  plans  can  be  complex;  they  may  require  many  parameters  for  coor¬ 
dinating  multiple  agents,  resources,  and  decision  points.  Furthermore,  multiple 
performance  metrics  may  be  used  to  assess  a  plan’s  execution,  and  may  involve 
trade  offs.  For  example,  we  consider  the  problem  of  how  to  deploy  a  team  of 
autonomous  unmanned  vehicles  (AUVs),  managed  by  a  human  operator,  to  con¬ 
duct  situation  assessment  in  preparation  for  a  Humanitarian  Assistance/Disaster 
Relief  (HADR)  mission.  The  team  includes  a  heterogeneous  set  of  robotic  plat¬ 
forms  that  vary  in  their  capabilities.  If  we  want  to  minimize  vehicle  energy  con¬ 
sumption  while  maximizing  coverage  of  the  area  surveyed,  how  many  vehicles 
(and  of  what  type)  should  we  deploy,  and  how  should  they  behave?  If  we’re  not 
conservative,  we  may  expend  too  many  resources  (vehicles  or  energy),  reducing 
our  capability  to  respond  to  other  emergencies  in  the  near-term.  Likewise,  we 
run  the  risk  of  poor  performance  if  too  few  resources  are  deployed,  which  may 
have  serious  consequences  to  the  effected  civilians. 

This  problem  is  not  unique  to  military  mission  planning,  as  similar  deci¬ 
sions  must  be  made  for  many  other  types  of  resource-bounded  tasks  such  as 
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emergency  first  response,  sports  team  financial  management,  and  local  govern¬ 
ment  budget  planning.  In  each  case,  parameterized  plans  may  exist  to  solve  a 
complex  problem,  but  how  should  their  parameters  be  set?  In  some  situations 
(such  as  ours),  this  problem  can  be  compounded  when  the  model  for  mapping 
parameter  settings  to  expected  performance  is  incomplete.  While  such  models 
may  be  acquired,  this  requires  access  to  problem-solving  expertise  or  a  massive 
database  that  records  problem  attributes,  parameter  settings,  and  the  resulting 
performance  metrics.  Unfortunately,  we  lack  both  for  our  task. 

We  describe  a  case-based  approach  to  solve  this  problem  where  our  parame¬ 
ters  include  the  number  and  types  of  AUVs  to  send  along  with  algorithm  specific 
options  for  surveying  an  area,  and  report  on  its  utility  in  an  initial  empirical 
study.  Our  algorithm  uses  a  similarity-weighted  vote  to  set  parameter  values, 
and  generates  a  single  score  for  a  set  of  performance  metrics.  We  found  that 
our  approach  generates  plans  that  performed  well  in  simulation  studies  versus 
baseline  approaches,  particularly  when  state  uncertainty  is  high. 

We  describe  related  work  in  Section  2,  then  propose  how  AUVs  can  sup¬ 
port  HADR  missions  in  Section  3.  In  Section  4,  we  overview  the  Situated  Deci¬ 
sion  Process  (SDP),  which  defines  a  role  for  case-based  parameter  selection.  We 
present  our  case-based  algorithm  in  Section  5,  report  its  application  in  simulated 
scenarios  in  Section  6,  and  discuss  the  results  in  Section  7  before  concluding. 

2  Related  Work 

We  focus  on  the  problem  of  setting  the  values  for  multiple  parameters,  which 
are  then  used  by  a  goal  reasoning  (GR)  module  to  assist  with  multi- agent  plan 
generation.  CBR  has  previously  been  used  for  parameter  setting.  For  exam¬ 
ple,  Wayland  [16]  is  a  deployed  case-based  system  used  to  select  the  parame¬ 
ter  values  for  an  aluminum  die-casting  machine.  It  uses  a  type-constrained  and 
feature-weighted  1-nearest  neighbor  rule  for  retrieval  and  a  rule-based  approach 
for  adaptation.  Unlike  our  approach,  Wayland’s  cases  do  not  include  specific 
performance  outcome  data,  nor  use  them  for  case  reuse. 

Several  CBR  systems  set  parameter  values  while  interacting  with  other  rea¬ 
soning  modules,  though  to  our  knowledge  ours  is  the  first  to  supply  them  to  a 
GR  module  for  controlling  multiple  AUVs  in  coordinated  plans.  Weber  et  al.  [19] 
uses  artificial  neural  networks  to  model  software  programs  and  biological  sys¬ 
tems,  resulting  in  a  case  representation  of  problem,  solution,  and  outcome  similar 
to  ours.  Genetic  algorithms  are  used  to  learn  the  initial  cases,  which  are  then 
clustered  using  their  solutions  and  given  to  a  discriminant  analysis  technique  to 
find  select  problem  features.  While  they  are  concerned  with  one  outcome  metric 
we  focus  on  a  multi-objective  problem,  and  using  GAs  may  be  infeasible  in  our 
domain  due  to  long  run  times.  Jin  and  Zhu  [5]  use  CBR  to  set  the  initial  param¬ 
eter  values  for  an  injection  molding  process.  These  settings  are  repeatedly  tested 
and  modified  by  a  fuzzy  reasoner  until  all  defects  are  eliminated,  at  which  time 
a  new  case  is  stored.  While  not  emphasized  in  this  paper,  we  use  CBR  to  repeat¬ 
edly,  rather  than  only  initially,  recommend  parameter  settings  throughout  plan 


execution.  Montani  [11]  uses  CBR  to  set  the  parameters  for  multiple  systems, 
including  a  rule-based  reasoner  used  to  modify  therapies  for  diabetes  patients. 
We  focus  on  multi-agent  planning  rather  than  rule  revision,  and  we  focus  on  the 
control  of  AUV  teams  rather  than  a  health  sciences  application.  Finally,  Pavon 
et  al.  [15]  use  CBR  to  revise  Bayesian  network  models  for  setting  the  control 
parameters  of  a  genetic  algorithm  that  performs  root  identification  for  geomet¬ 
ric  problems.  In  contrast,  our  CBR  algorithm  uses  a  performance-based  voting 
procedure  rather  than  abduction  to  set  parameters. 

Jaidee  et  al.  [3]  also  integrated  CBR  with  a  GR  module  for  team  coordination 
planning.  However,  they  use  CBR  to  formulate  goals  while  we  use  it  to  set 
parameter  values,  and  we  focus  on  robotic  control  rather  than  video  games. 

Other  have  studied  CBR  for  robotics  applications.  For  example,  Likhachev 
et  al.  [7]  use  CBR  to  learn  parameter  settings  for  the  behavior-based  control  of 
a  ground  robot  in  environments  that  change  over  time.  Their  approach  learns 
and  adapts  cases,  effectively  searching  the  behavior  parameter  space,  until  good 
performance  is  obtained.  While  their  work  focuses  on  motion  control  for  a  single 
robot,  we  instead  focus  on  the  high-level  control  of  robot  teams.  Karol  et  al.  [6] 
also  focus  on  robot  team  coordination  (for  RoboCup  soccer).  They  use  CBR  to 
select  actions  that  are  transformed  to  motion  control  parameters.  Ros  et  al.  [18] 
also  focus  on  action  selection  for  RoboCup  soccer,  and  use  a  sophisticated  repre¬ 
sentation  and  reasoning  method.  Of  interest  to  us  is  that  each  agent  can  behave 
independently  and  abort  the  plan,  which  is  also  essential  in  our  domain  because 
unexpected  state  changes  could  have  catastrophic  consequences.  However,  this 
body  of  research  focuses  on  motion  planning  for  relatively  short-term  behaviors, 
whereas  we  focus  on  longer  duration  plans  that  are  monitored  by  a  GR  module. 

Finally,  CBR  has  been  studied  for  several  military  applications,  including 
those  involving  disaster  response.  For  example,  Abi-Zeid  et  al.  [1]  studied  inci¬ 
dent  prosecution,  including  real  time  support  for  situation  assessment  in  search 
and  rescue  missions.  Their  prototype  system,  ASISA,  uses  CBR  to  select  hierar¬ 
chical  information-gathering  plans  for  situation  assessment.  Munoz-Avila  et  al.’s 
[12]  HICAP  instead  uses  a  conversational  CBR  system  to  assist  operators  with 
refining  tasks  in  support  of  noncombatant  evacuation  operations.  SiN  [13]  ex¬ 
tends  their  work,  integrating  a  planner  to  automatically  decompose  tasks  where 
feasible.  However,  while  these  systems  use  planning  modules  to  support  rescue 
operations,  they  do  not  set  parameters  for  multiagent  plans,  nor  focus  on  coor¬ 
dinating  robot  team  behaviors. 


3  Domain:  Military  HADR  Operations 

HADR  operations  [14]  are  performed  by  several  countries  in  response  to  events 
such  as  Hurricane  Katrina  (August  2005),  the  Haiti  earthquake  (2010),  and  Ty¬ 
phoon  Haiyan  (November  2013).  Before  support  operations  can  arrive  an  initial 
wave  of  responders  must  gather  required  information  about  the  impact  zone  (e.g., 
infrastructure  status,  suggested  ingress  and  evacuation  routes,  and  survivor  loca¬ 
tions)  .  Current  operations  employ  remotely  controlled  drones  and  human-piloted 


helicopters  to  gather  this  information.  We  instead  propose  deploying  a  hetero¬ 
geneous  team  of  AUVs  with  appropriate  sensor  platforms  to  automate  much  of 
this  process,  so  as  to  reduce  time  and  cost.  We  claim  that  this  should  enable 
responders  to  perform  critical  tasks  more  quickly  for  HADR  operations. 

In  this  paper  we  focus  a  module  of  the  SDP,  which  we  are  developing  to 
assist  with  HADR  operation.  Under  a  human  operator’s  guidance,  the  SDP  will 
deploy  a  team  of  heterogeneous  AUVs,  coordinated  by  a  GR  module  to  identify 
which  goals  need  to  be  accomplished,  and  deploy  the  AUVs  to  best  achieve  them 
[17].  To  perform  this  task  the  GR  module  needs  to  generate,  compare,  schedule, 
and  dispatch  plans  for  achieving  these  goals.  Plans  vary  in  their  performance 
metrics  based  on  resource  allocation  (e.g.,  of  AUVs  and  their  search  algorithm), 
and  selecting  among  them  requires  the  GR  to  deliberate  about  their  predicted 
performance  (e.g.,  to  maximize  search  area  coverage  and  minimize  energy  con¬ 
sumption)  .  To  do  this,  we  use  a  case-based  algorithm  to  select  parameter  settings 
for  the  GR  module,  where  cases  associate  problems  with  solutions  (i.e. ,  param¬ 
eter  settings)  and  their  performance  metrics.  This  enables  the  SDP  to  propose 
multiple  solutions  for  a  goal  by  optimizing  on  different  metrics. 


4  Simulating  the  Situated  Decision  Process  (SDP) 

To  provide  context  for  our  work  on  case-based  parameter  selection,  we  briefly  de¬ 
scribe  the  SDP’s  modules,  the  simulated  AUVs  that  it  will  control,  their  sensing 
models  and  search  algorithms,  and  scenario  performance  metrics. 

4.1  SDP  Modules 

Figure  1  highlights  SDP’s  primary  modules.  It  will  support  a  Forward  Air  Con¬ 
troller  (FAC)  in  surveying  and  assessing  Areas  of  Interest  (Aols)  of  a  HADR 
mission.  The  FAC  will  provide  mission  objectives  and  constraints  using  a  Human- 
Robot  Interface,  whose  GUI  will  permit  highlighting  of  Aols,  important  assets, 
and  related  information.  These  will  be  provided  to  a  GR  module  [3],  which  will 
help  to  decompose  the  given  objectives.  This  will  result  in  selecting  goals  and 
conditions  for  synthesizing  a  finite  state  automaton  to  control  the  motions  of 
the  AUVs.  Our  Goal  Reasoner  depends  on  a  Case-Based  Parameter  Selector  to 
recommend  parameter  settings  given  geographical  regions  and  constraints. 

4.2  Simulation  Platforms 

HADR  missions  begin  with  generating  a  new  road  map,  as  a  common  feature  of 
disasters  is  the  collapse  of  buildings  or  washing  out  of  roads.  We  propose  to  use 
three  AUV  types  (Figure  2),  which  vary  in  their  motion  properties,  to  perform 
infrastructure  assessment.  The  number  of  each  type  corresponds  to  a  parameter 
that  needs  to  be  set  in  HADR  mission  plans. 

From  the  air,  the  roadways  that  are  still  intact  are  visible  to  downward  look¬ 
ing  cameras,  which  we  assume  are  carried  at  relatively  high  altitude  (>  1000m) 
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Fig.  1.  The  Situated  Decision  Process  (SDP) 


(a)  STUAS  (b)  UGV  (c)  MAY 


Fig.  2.  Platforms  Simulated  in  our  Study  (Acknowledgements:  (A)  US  Marines,  Shan¬ 
non  Arledge;  (B)  US  Navy,  Elizabeth  R.  Allen;  (C)  NASA,  Sean  Smith) 

by  small  tactical  unmanned  aircraft  systems  (STUAS)  and  at  low  altitudes 
(<  100m)  by  multirotor  air  vehicles  (MAVs).  From  the  ground,  the  best  road¬ 
mapping  sensor  is  a  3D  omnidirectional  lidar  mounted  on  unmanned  ground 
vehicles  (UGVs). 

We  model  AUVs  as  nonholomonic  physicomimetic  agents  [10].  This  allows  us 
to  run  them  simultaneously  in  the  MASON  [9]  multi-agent  simulator  to  compare 
the  performance  of  teams  that  vary  in  their  numbers  of  each  AUV  type. 

4.3  Sensing  model 

Downward  facing  cameras  and  scanning  lidars  take  measurements  of  the  region 
surrounding  an  agent  as  shown  in  Figure  3a.  For  flying  agents,  the  width  d  of 
the  sampled  area  depends  on  the  agent’s  altitude  h  and  the  field  of  view  of  the 
camera  a.  For  UGVs,  d  is  the  maximum  effective  range  of  their  lidar.  Both  of 
these  sensors  are  data-rich  and  information-poor,  and  so  rather  than  modeling 
transmission  of  whole  images,  we  assume  they  segment  their  sensed  area  into 


(a)  Top  View  (b)  Front  View 

Fig.  3.  Schematic  of  Downward-Facing  Camera  Image  Segmented  into  Discrete  Regions 


discrete  grid  cells  dm,in  meters  across,  where  dmin  is  half  the  expected  width  of 
the  target  signal  (e.g.,  roads).  The  agent’s  speed  u  and  measurement  interval 
Atm  determine  the  minimum  depth  of  the  simulated  sensor  grid.  We  assume  a 
square  grid  of  d  =  60m  and  dmin  =  6m  for  all  agents  in  our  simulator. 

At  each  time  step,  where  Atm  =  .Is,  our  simulated  sensors  return  a  Boolean 
grid  that  classifies  each  cell  based  on  whether  it  contains  the  target  signal.  Each 
sensor  updates  an  estimate  of  the  percentage  of  coverage  of  a  set  of  global  grid 
cells  using  a  one-dimensional  Kalman  filter,  which  fuses  data  from  multiple  plat¬ 
forms  (whose  sensor  views  overlap) .  A  Kalman  filter  allows  us  to  control  the  rate 
at  which  cell  uncertainty  can  grow  or  be  reduced  with  guaranteed  and  predictable 
rates  of  convergence  of  each  cell  to  its  mean  value. 


4.4  Area  Coverage  Search  Algorithm 

We  use  three  parameters  that  can  affect  how  the  AUVs  search  a  geographical 
area  during  HADR  missions,  where  we  assume  they  all  begin  a  scenario  near  a 
single,  randomly  selected  point  and  follow  a  grazing  area  coverage  algorithm  [8] . 
Our  first  parameter  determines  how  a  simulated  AUV  searches.  Greedy  search 
applies  a  physicomimetics  force  to  drive  each  agent  towards  the  nearest  “food” , 
here  simulated  by  global  grid  cell  covariance.  In  contrast,  Segmented  search, 
depicted  in  Figure  4,  guides  each  agent  towards  the  nearest  “food”  in  its  own 
Voronoi  cell.  Our  second  parameter,  On  Demand  Recharging ,  is  true  iff  the  AUVs 
can  automatically  recharge  themselves.  Finally,  if  the  third  parameter  Mobile 
Charger  is  true,  then  a  mobile  base  station  follows  the  AUVs. 


4.5  Scenario  Metrics 

Time  is  critical  in  HADR  missions,  both  for  reaching  survivors  and  collecting 
information  to  ensure  that  aid  is  properly  distributed  when  it  arrives.  However, 
the  infrastructure  damage  that  needs  to  be  detected  often  makes  resources  such 
as  electricity  and  fuel  more  precious.  Thus,  we  define  the  following  performance 
metrics  to  assess  agent  performance  in  our  simulation  studies: 

1.  Coverage:  Percentage  of  the  Aols  searched 


Fig.  4.  Voronoi  Partition  of  a  Search  Area  among  Multiple  MAV  Agents 
2.  Energy  Consumption  (in  Joules):  Total  energy  consumed 

We  measure  Coverage  over  a  HADR  scenario’s  entire  duration,  which  we  set 
to  30  minutes  in  our  empirical  study  because  we  expect  HADR  personnel  to 
arrive  within  30  minutes  after  the  AUVs  are  deployed.  Energy  Consumption  is 
calculated  uniformly  across  fuel  and  battery  types  used  to  power  the  AUVs. 

In  our  evaluation  (Section  6),  we  also  use  the  compound  Efficiency  metric, 
defined  as  the  ratio  of  Coverage  to  Energy  Consumption.  We  found  that  it  is  a 
better  heuristic  (than  the  other  two  metrics)  for  guiding  parameter  selection. 

5  Case-Based  Parameter  Selection 

We  use  a  case-based  algorithm  in  the  SDP  to  select  parameter  settings  for  HADR 
scenarios.  We  describe  our  case  representation,  including  a  case’s  recommended 
parameter  settings,  in  Section  5.1.  Our  algorithm  employs  case  retrieval  and 
reuse,  as  described  in  Sections  5.2  and  5.3,  but  does  not  perform  case  revision 
or  retention,  which  we  leave  for  future  work. 

5.1  Case  Representation 

We  represent  a  case  C  =  (P,  S,  O)  as  a  vector  of  attributes  denoting  a  prob¬ 
lem  P,  its  multi-dimensional  solution  S,  and  the  multi-dimensional  performance 
outcome  O  recorded  when  executing  a  HADR  mission  plan  with  parameters  S 
to  the  scenario  summarized  by  P.  Table  1  details  this  representation. 

The  attributes  of  P  characterize  the  HADR  scenario  and  are  used  to  define 
case  similarity  (see  Section  5.2).  P’s  parameters  are  predominately  focused  on 
geometric  features  given  the  task  of  surveying  an  area.  These  include  the  number 
of  disjoint  Areas  of  Interest  (Aols)  in  the  scenario,  the  distance  between  opposite 
corners  of  the  (rectangular)  Area  of  Operations  (AO),  the  total  area  sizes  of  the 
Aols,  and  Coverage  Decay,  a  binary  parameter  indicating  whether  the  sensor  in¬ 
formation  for  a  grid  cell  decays  over  time  and  becomes  obsolete  (i.e.,  uncertainty 
increases).  This  allows  us  to  model  changes  to  the  environment,  such  as  when 
a  fire  spreads  across  grid  cells.  (Evaluating  the  effectiveness  of  these  features 
versus  others  is  left  for  future  research.) 


Table  1.  Case  Representation  used  for  SDP  Parameter  Selection 


Case  Component 

Attribute  Name 

Value  Range 

Problem  (P) 

#  Disjoint  Aols 

(1,  io] 

AO  Diagonal  Distance 

[424,  1980] 

Total  Area  Size  of  Aols 

[500,  1,568,000]  m2 

Coverage  Decay 

{l=true,  0=false} 

Solution  ( S ) 

#  STUASs 

[0,  3] 

#  UGVs 

[0,  3] 

#  MAVs 

[0,  10] 

Search  algorithm 

{g=greedy,  s=segmented} 

On  Demand  Recharging 

{true,  false} 

Mobile  Charger 

{true,  false} 

Outcome  ( O ) 

Coverage 

[0%,  95%] 

Energy  Consumption 

[0,  8,244,000]  Joules 

A  case’s  solution  S  contains  the  settings  for  six  parameters  that  define  how 
to  apply  the  SDP  to  P.  These  include  the  number  of  each  type  of  unmanned 
vehicle  to  deploy  (i.e.,  STUASs,  UGVs,  and  MAVs),  the  type  of  area  coverage 
search  algorithm  to  use  (i.e.,  Greedy  or  Segmented),  whether  to  use  On  Demand 
Recharging,  and  whether  to  deploy  a  Mobile  Charger  (for  UGVs).  Finally,  the 
outcome  attributes  O  are  the  scenario  metrics  described  in  Section  4.5. 

In  our  empirical  study,  our  case  library  includes  multiple  cases  with  the  same 
problem,  but  differ  in  their  solutions,  because  our  simulator  is  non-deterministic. 


5.2  Case  Retrieval 

Our  parameter  selection  algorithm  retrieves  cases  in  two  phases.  In  the  first 
phase  it  computes  the  similarity  of  a  given  query  (whose  attributes  are  those 
listed  for  Problems  in  Table  1)  with  the  distinct  problems  among  cases  in  a 
library  L,  and  retrieves  those  cases  L'  £  L  whose  similarity  exceeds  a  threshold 
f,  where  the  similarity  of  a  query  q  and  a  case’s  problem  c.P  is  defined  as  follows: 

sim(q ,  c.P )  =  rp  1  ~  ^ n  C’Pa\  (1) 

r  I  a^p  amax 

where  amax  is  the  largest  possible  absolute  difference  among  two  values  of  an 
attribute  a. 

In  the  second  phase,  L'  is  filtered  to  remove  cases  whose  performance  metrics 
are  low.  This  phase  is  needed  to  remove  cases  whose  problems  may  have  had  a 
poor  sampling  bias.  Filtering  is  done  using  a  function  that  scores  the  outcomes 
of  each  case  c  £  L'  relative  to  every  other  case  in  L' .  We  score  each  case  c  by 
calculating  a  Student  t-statistic4  for  its  Efficiency  relative  to  some  set  of  cases 

4  We  use  the  Student  t-statistic  because  the  population  statistics  of  our  non- 
deterministic  scenarios  are  unknown,  and  we  have  verified  that  Coverage  and  Energy 
Consumption  are  both  normally  distributed. 


Algorithm  1:  Case  Reuse  Algorithm  for  Setting  Parameter  Values 
Inputs:  Query  q,  Cases  Nq 

Returns:  Solution []  //  A  vector  of  parameter  settings  for  q 
Legend: 

Nq  //  q’s  ^-Neighborhood  of  cases 

s  //  A  parameter  among  those  in  q’s  predicted  solution  S 
Votes[]  //  Summed  weighted  scores,  indexed  by  parameter 
SumWeights\\  //  Summed  weights,  indexed  by  parameter 

SetParameters(g,  Nq)  = 
foreach  s  £  S  do 

Votes[s ]  =  SumWeights[s]  =  0; 

foreach  c  £  Nq  do 
foreach  s  £  c.S  do 

weight  =  sim(g,c.P); 

Votes[s]  +=  weight  x  scor e(c,Nq)-, 

_  SumWeights[s]  +=  weight-, 

foreach  s  £  c.S  do 

//  Compute  value  for  parameter  s  with  the  highest  weighted  vote 

Solution[s]  =  maxParamValue(Vote.s[s]/«S'wmVFei<?hts[s]); 

return  Solution\\-, 


C  as  follows: 


score{c,  C ) 


(2) 


where  ce  is  the  efficiency  of  case  c,  Ce  is  the  mean  efficiency  of  all  cases  in  C, 
and  s  is  the  sample  deviation  of  C. 

For  each  case  c  £  L' ,  we  compute  its  score(c,L').  For  each  subset  of  cases 
Lp  C  L'  with  problem  p,  we  identify  its  n%  most  Efficient  cases  and  compute 
their  mean,  denoted  by  mean (L',p,  n).  We  then  rank  these  mean  values  across 
all  problems  p  represented  in  L' ,  identify  the  problems  P'  with  the  k  highest 
values  of  mean(L',p,  n),  and  return  all  cases  with  a  problem  p  £  P' .  This  yields, 
for  query  q,  a  neighborhood  of  retrieved  (and  filtered)  cases  Nq. 


5.3  Case  Reuse 

SDP  uses  Algorithm  1  for  case  reuse.  It  computes  a  similarity-weighted  vote 
among  the  retrieved  cases  Nq,  where  a  vote  of  a  case  c  £  Nq  is  the  product  of 
its  problem’s  similarity  to  q  and  its  scor e(c,Nq)  as  defined  in  Equation  2. 

Our  HADR  sceanrio  simulator  is  non-deterministic.  Thus,  the  metrics  com¬ 
puted  for  a  given  (problem, solution)  pair  can  vary  each  time  a  scenario  is  ex¬ 
ecuted.  To  account  for  this  we  compute  these  metrics  using  their  mean  values 
across  a  set  of  scenario  executions. 


Given  a  query  q  and  a  set  of  cases  Nq,  Algorithm  1  computes  a  score  for  each 
case  c  G  Nq  (the  fc-Neighborhood  of  q)  and  then  computes  the  weighted  vote  of 
each  parameter  in  q's  solution  vector  S.  Our  algorithm  allows  for  all  cases  in  Nq 
to  contribute  votes  to  the  adapted  solution,  and  assumes  that  the  parameters 
are  independent.  It  weights  each  vote  by  the  similarity  of  c.P  to  q1  giving  more 
(normalized)  weight  to  cases  that  are  more  similar  to  query  q.  Finally,  it  returns 
a  solution  (i.e. ,  a  vector  of  parameter  settings). 

6  Empirical  Study 

We  empirically  tested  four  research  hypotheses: 

HI  Solutions  obtained  using  problem  knowledge  can  outperform  random  solu¬ 
tions. 

H2  No  single  solution  performs  optimally  on  all  problems. 

H3  Using  a  case-based  approach  to  set  parameters  yields  solutions  that  perform 
well  in  comparison  to  a  known  good  solution. 

H4  Case  adaptation  increases  performance  metrics. 

In  the  following  sections  we  describe  the  metrics,  simulation  data,  empirical 
method,  algorithms  tested,  the  results,  and  their  analysis. 

6.1  Metrics 

We  initially  planned  to  use  the  (raw)  outcome  metrics  in  O  listed  in  Table 
1,  namely  Coverage  and  Energy  Consumption.  However,  neither  metric  alone 
provides  comprehensive  insights  for  the  results.  This  motivated  us  to  introduce 
Efficiency  (Section  4.5),  which  we  use  for  results  analysis. 

6.2  Simulation  Data 

We  conducted  our  study  using  MASON  [9],  a  discrete-event  multiagent  simula¬ 
tor  that  models  all  physics  behaviors  and  physicomimetics  control.  A  problem 
scenario  in  MASON  is  defined  by  the  problem  attributes  P  (see  Table  1),  the 
locations  and  sizes  of  the  Aols,  and  the  AUVs’  starting  locations.  (We  did  not 
include  these  latter  attributes  in  P  due,  in  part,  to  their  high  variance.)  Running 
MASON  scenarios  also  requires  setting  the  parameters  in  S  (e.g.,  the  number 
of  AUVs  per  platform  type,  the  type  of  area  search  algorithm  to  use).  Running 
a  parameterized  MASON  scenario  yields  the  set  of  outcomes  in  O.  MASON  is 
non-deterministic;  these  outcomes  can  vary  each  time  it  runs  a  parameterized 
scenario  because  it  executes  the  actions  of  multiple  agents  in  a  random  order. 

To  generate  the  case  library  L  for  our  experiments,  we  created  a  problem 
scenario  generator  that  outputs  random  scenarios  (according  to  a  uniform  dis¬ 
tribution)  over  the  attributes  for  problems  and  solutions,  using  the  ranges  shown 
in  Table  1.  We  used  it  to  generate  20  problem  scenarios  and  paired  each  with  100 
solution  vectors  to  yield  the  P  and  S  values  for  2000  cases.  We  then  executed 
each  (Scenario, Solution)  pair  in  MASON  10  times,  and  recorded  the  average 
values  of  the  outcome  metrics  ( O )  with  each  case. 


Table  2.  Test  Scenario  Problems 


Problem  # 

Disjoint  Aols 

AO  Diagonal  Distance 

Aols’  Area 

Coverage  Decay 

1 

1 

1621 

259972 

2 

7 

1553 

396580 

No 

3 

6 

1400 

375275 

4 

6 

1001 

69279 

5 

4 

1332 

130684 

6 

10 

1451 

285652 

7 

5 

636 

14452 

Yes 

8 

3 

1773 

197925 

9 

10 

1222 

165494 

10 

3 

1833 

194512 

6.3  Empirical  Method 

We  tested  our  CBR  algorithms  (Section  6.4)  using  a  partial  cross-validation 
strategy.  In  particular,  we  selected  the  first  10  problem  scenarios  (see  Table  2) 
among  the  20  generated,  and  performed  leave-one-out  testing.  That  is,  when  we 
tested  a  problem  scenario  pi ,  we  temporarily  removed  the  100  cases  in  L  with 
problem  p,  for  testing,  leaving  1900  cases  in  L.  We  repeated  this  cross-validation 
10  times  to  generate  the  data  for  our  analyses. 

The  time  required  to  generate  L  was  four  days,  even  after  parallelizing  the 
10  trials  per  problem  and  dividing  the  problems  among  two  multi-cpu  machines. 
Thus,  time  constraints  prevented  us  from  running  a  complete  leave-one-out  test 
on  all  20  problems,  which  we  leave  for  future  work. 


6.4  Algorithms  Tested 

We  tested  two  CBR  algorithms  that  vary  in  whether  they  perform  case  adap¬ 
tation/reuse.  They  both  apply  the  case  retrieval  algorithm  described  in  Section 

5.2,  where  we  set  k  =  3,  n  =  20%  and  t  =  75%.  (Values  were  not  optimized). 

CBRAr(Non-adapted)  This  retrieves  Ng,  selects  a  case  c  £  Nq  that  has  the 
maximum  scor e(c,Nq),  and  executes  MASON  using  the  solution  vector  c.S. 

CBR^  (Adapted)  After  case  retrieval,  this  applies  Algorithm  1  to  q  and  Nq. 
It  outputs  solution  vector  S,  which  is  given  to  MASON  to  execute. 

We  also  included  the  following  three  baselines  in  our  experiments: 

Random  Mean  (R)  This  does  not  require  simulation.  Instead,  for  a  given  sce¬ 
nario,  it  locates  the  100  cases  in  L  whose  problem  P  matches  the  scenario, 
and  yields  the  mean  value  of  their  outcome  metrics. 

Best  Overall  (O)  This  evaluates  how  well  a  single  good  solution  performs 
across  all  problems.  It  finds  the  case  c  £  L  whose  solution  c.S  has  the 
highest  score(c,L)  and  applies  it  to  every  problem  in  the  test  set. 
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Fig.  5.  Performance  of  CBRa  (A),  CBRn  (N),  O,  and  S  on  Coverage,  where  higher 
values  are  preferred,  and  Energy  Consumption,  where  lower  values  are  preferred 


Best  Sampled  (S)  This  is  designed  to  produce  a  good  known  solution  for  any 
query.  It  locates  the  100  cases  L'  C  L  whose  problem  matches  the  given 
scenario,  finds  the  case  c  £  L'  with  highest  score(c,  L'),  and  returns  its 
outcome  metrics  c.O. 


6.5  Results  and  Analysis 

HI:  Figure  5  displays  plots  for  Coverage  and  Energy  Consumption  for  the  test 
problems.  Using  a  one-tailed  t-test,  we  found  that  R  consistently  recorded  signif¬ 
icantly  lower  Efficiency  (not  shown  for  space  reasons)  than  the  other  algorithms, 
which  supports  HI.  For  Coverage,  S  performed  significantly  better  on  6  scenar¬ 
ios,  O  9  times,  CBRn  4  times,  and  CBRa  6  times.  For  Energy  Consumption, 
these  algorithms  significantly  outperformed  R  for  all  scenarios  (except  problem 
4  for  CBRn  and  O).  Given  this,  we  will  ignore  R  for  the  rest  of  this  section. 

H2:  We  expected  that  O  would  be  consistently  outperformed  by,  or  perform 
simmilarly  to,  the  remaining  algorithms  for  the  9  test  problems  (O’s  solution 
came  from  a  case  with  Problem  1,  which  we  excluded  from  testing).  Compar¬ 
ing  Efficiency,  S  significantly  outperformed  O  on  8  test  problems,  while  CBRn 
{CBRa)  significantly  outperformed  O  on  4  of  the  6  problems  in  which  Coverage 
Decay  exists  (primarily  due  to  lower  Energy  Consumption).  These  results  sup¬ 
port  this  hypothesis  (i.e.,  that  one  solution  does  not  perform  well  across  all  test 
problems  and  tailoring  of  solutions  is  preferable) ,  particularly  for  when  Coverage 
Decay  (i.e.,  higher  state  uncertainty)  exists. 

H3:  The  results  support  this  hypothesis,  especially  for  Coverage  Decay  problems. 
For  example,  when  comparing  Efficiency  CBRa  significantly  outperformed  S  on 


Table  3.  Solutions  generated  by  algorithms  S,  CBRn  (N),  and  CBRa  (A)  for  prob¬ 
lems  1-10  (where  t  for  Greedy  indicates  that  Greedy  search  was  used,  Recharging  means 
On  Demand  Recharging,  and  Mobile  C  means  a  Mobile  Charger  was  used) 
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problems  {6,7,9,10}  while  CBRN  outperformed  S  on  {5, 6, 7, 9}.  We  discuss  this 
further  in  Section  7. 

H4:  We  analyzed  whether  CBRa  outperformed  CBRn ,  which  seems  to  be  in¬ 
dicated  by  Figure  5.  For  Coverage,  CBRa  significantly  outperforms  CBRn  on 
4  problems,  though  sometimes  at  the  expense  of  Energy  Consumption  (e.g.,  for 
problem  3).  However,  while  the  graphs  suggest  some  advantages  to  case  reuse, 
the  significance  tests  are  inconclusive. 


7  Discussion 

The  results  are  challenging  to  analyze  because  this  is  a  multi-objective  problem. 
Although  Efficiency  combines  them,  it  treats  both  outcome  metrics  equally,  in¬ 
dependently  of  whether  this  is  preferable.  The  CBR  algorithms  performed  well 
on  Coverage,  but  were  mediocre  on  Energy  Consumption  compared  to  S  on 
problems  1-4,  which  are  not  Coverage  Decay  problems.  However,  while  S  is  sup¬ 
posed  to  be  a  good  solution,  the  plans  generated  by  the  CBR  algorithms  perform 
comparably  without  cheating  (i.e.,  training  on  test  problems). 

Table  3  displays  the  solutions  generated  by  S,  CBRNl  and  CBRa  for  each 
problem.  (Baseline  O  always  chose  2  MAVs,  0  UGVs,  3  STUASs,  Segmented 
search,  On  Demand  Recharging,  and  no  Mobile  Charger.)  The  large  variance 
in  Coverage  (Figure  5)  is  partially  caused  by  solutions  that  use  few  STUAS 
(Table  3).  This  is  due  in  part  to  a  physics  model  that  permits  STUAS,  in  some 
conditions,  to  depart  (i.e.,  fly  out  of)  an  Aol.  However,  Coverage  variations  are 
much  smaller  for  problems  {6,7,9}  than  problems  {5,8},  even  though  they  all 
use  only  one  STUAS.  This  is  because  the  latter  problems  have  fewer  disjoint 
Aols,  which  may  reduce  the  frequency  with  which  STUAS  depart  the  map.  We 
will  address  this  issue  in  future  work. 


For  Coverage  Decay  problems,  95%  Coverage  becomes  nearly  impossible  to 
obtain,  and  the  CBR  algorithms  converge  to  using  one  STUAS  to  increase  Effi¬ 
ciency  (i.e. ,  low  Energy  Consumption  and  large  Coverage).  Operating  multiple 
AUVs  yields  diminishing  benefits  (i.e.,  marginally  larger  Coverage  at  a  higher 
Energy  Consumption),  although  an  operator  may  prefer  higher  Coverage.  We 
will  further  explore  what  metrics  to  use  in  future  work. 

8  Conclusion 

To  our  knowledge,  this  is  the  first  study  that  uses  CBR  to  set  parameters  that 
control  the  behavior  of  a  heterogeneous  AUV  team  (here,  to  perform  situation  as¬ 
sessment  for  HADR  missions) .  Our  simulation  study  showed  that  our  case-based 
algorithm  can  perform  well  on  this  task  (i.e.,  outperforms  a  random  approach  as 
well  as  the  parameter  values  found  to  perform  best  for  any  scenario,  and  performs 
comparably  to  the  best  known  solution  for  a  given  problem  scenario).  While  case 
adaptation  tends  to  improve  performance,  the  increases  are  not  significant. 

Our  simulation  models  only  some  of  the  real-world  environment’s  character¬ 
istics  (e.g.,  it  ignores  wind,  ground  cover,  and  vehicle  overhead  costs),  which 
could  impact  our  results.  For  example,  substantial  ground  cover  could  degrade 
the  data  collected  by  a  STUAS,  and  increase  reliance  on  the  other  two  platforms. 
Adding  these  parameters  would  make  pre-calculation  of  solutions  for  the  entire 
problem  space  infeasible,  and  strongly  motivate  the  need  for  case  retention. 

Our  case  reuse  algorithm  makes  a  strong  independence  assumption  and  is 
sensitive  to  small  training  sets.  We  will  study  other  approaches  for  adapting 
cases  such  as  using  stretched  neighborhoods  for  each  dimension  [4] .  We  will  also 
study  methods  for  seeding  the  case  base  with  good  solutions  learned  from  multi¬ 
objective  optimization  functions.  Although  this  type  of  optimization  has  been 
used  previously  (e.g.,  [2]),  we  propose  to  use  it  to  seed  the  case  base  rather  than 
use  cases  to  seed  further  multi-objective  searches. 

Finally,  we  will  test  the  SDP  for  its  ability  to  help  an  operator  guide  a  team 
of  AUVs  in  outdoor  field  tests  under  controlled  conditions.  We  have  scheduled 
such  tests,  with  increasingly  challenging  scenarios,  during  the  next  three  years. 
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